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Abstract. Sharpening (a particular case of) a result of Szemeredi and Vu |SV06| 
and extending earlier results of Sarkozy [S89J and ourselves |L97bj . we find, sub- 
ject to some technical restrictions, a sharp threshold for the number of integer sets 
needed for their sumsct to contain a block of consecutive integers of length, com- 
parable with the lengths of the set summands. 

A corollary of our main result is as follows. Let k,l ^ 1 and n ^ 3 be integers, 
and suppose that A\,...,Ak C [0,1] are integer sets of size at least n, none of 
which is contained in an arithmetic progression with difference greater than 1. If 
k ^ 2 \(l — 1)/ (n — 2)] , then the sumset A±+- ■ -+Ak contains a block of consecutive 
integers of length k(n — 1). 



1. Background and summary of results 

The sumset of subsets Ai, . . . , of an additively written group is denned by 

AH h A k := {ai H h a k : ai 6 A h . . . , a k e A k }; 

if A\ — ■ ■ ■ — A}. — A, this is commonly abbreviated as kA. In the present paper we 
will be concerned exclusively with the group of integers, in which case a well-known 
phenomenon occurs: if all sets A» are dense, and their number k is large, then the 
sumset A\ + • • • + Ak contains long arithmetic progressions. There are numerous 
ways to specialize this statement, indicating the exact meaning of "dense", "large", 
and "long" , but in our present context the following result of Sarkozy is the origin of 
things. 

Theorem 1 (Sarkozy [S89[ Theorem 1]). Let I ^ n ^ 2 be integers, and write 



k := \(l + l)/(n— 1)]. Then, for every integer set A C [1,/] with \A\ = n, there 
exist positive integers d ^ k — 1 and k < 118k such that the sumset kA contains I 
consecutive multiples of d. 

In [L97bJ we established a sharp version of this result, replacing the factor 118 
with 2 (which is the best possible value, as conjectured by Sarkozy) and indeed, 
going somewhat further. 
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Theorem 2 (Lev |L97bl Theorem 1]). Let n ^ 3 and I ^ n — 1 6e integers, and write 
k := — — 2) J . Then for every integer set A C [0, /] with 0,1 £ A, gcd(A) = 

1, and \A\ = n, and every integer k ^ 2k, we have 

[k(21 - 2 - (« + l)(n - 2)), fcZ - /c(2Z - 2 - (« + l)(n - 2))] C fcA 

The complicated-looking interval appearing in the statement of Theorem [2] is best 
possible and, in general, cannot be extended even by 1 in either direction. As the 
interested reader will easily check, this interval (strictly) includes [kI, (k — k)1\ as a 
subinterval; consequently, if k ^ 2k + 1, then its length exceeds /. At the same time, 
if k ^ 2k, then the sumset kA may fail to contain a block of consecutive integers of 
length I; see example below. Thus, 2k + 1 is the smallest value of k such that, with 
A as in Theorem [2j the sumset kA is guaranteed to contain a block of consecutive 
integers of length /. 

At first sight, Theorem [2] is weaker than Theorem [1] in imposing the extra as- 
sumptions 0,1 e A and gcd(A) = 1. It is explained in [L97b| . however, that these 
assumptions are merely of normalization nature, and a refinement of Theorem [H with 
the bound 118k replaced by 2k + 2 (and some other improvements), is deduced from 
Theorem [2] in a relatively straightforward way. 

We notice that Theorem [2] yields a sharp result about the function f(n, k, I), intro- 
duced in [Sy06j. This function is defined for positive integers k and I ^ n ^ 2 to be 
the largest number /, such that for every n-element integer set A C [1, 1], the sumset 
kA contains an arithmetic progression of length /. (The length of an arithmetic pro- 
gression is the number of its terms, less 1). As indicated in [SV06j . "many estimates 
for f(n, k, I) have been discovered by Bourgain, Freiman, Halberstam, Green, Ruzsa, 
and Sarkozy" . It is worth noting in this connection that Theorem [2] establishes the 
exact value of this function for k large; namely, it is easy to deduce from Theorem [2] 
(and keeping in mind the trivial example A = [l,n]) that 

f(n,k,l) = k(n - 1); k ^ 2 [{I - 2)/{n - 2)J + 2. 

This, to our knowledge, remains the only situation where the value of f(n,k,l) is 
known precisely. 

An obvious shortcoming of Theorem [2] is that it applies only to identical set sum- 
mands. Potentially distinct summand are dealt with by Szemeredi and Vu in [Sy06j . 
A particular case of their result, to be compared with Theorems [1] and [2} is as follows. 
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Theorem 3 (Szemeredi-Vu |SV06I particular case of Corollary 5.2]). There exist 
positive absolute constants C and c with the following property. Suppose that k, I ^ 1 
and n ^ 2 are integers, and A\,...,Ak Q [0,1] are integer sets, having at least n 
elements each. If k > Cl/n, then the sumset A\ + ■ • - + contains an arithmetic 
progression of length at least ckn. 

Though the proof of Theorem [3l presented in |SV06] . is constructive, the constants 
C and c are not computed explicitly. Indeed, the argument leads to excessively large 
values of these constants, which may present a problem in some applications. 

The goal of this note is to merge the best of the two worlds, extending Theorem [2] 
onto distinct set summands, or, equivalently, proving a sharp analogue if Theorem [31 
with Cl/n and ckn replaced with best possible expressions. A result of this sort (up 
to a technical restriction, addressed below) is an almost immediate corollary from the 
following theorem, proven in Section [2j 

Theorem 4. Let k, I ^ 1 andn ^ 3 be integers, and write k := [(/ — l)/(n — 2)] — 1. 
Suppose that Ai, . . . , A 2k+ \ C [0, /] are integer sets, having at least n elements each, 
and such that none of them are contained in an arithmetic progression with difference 
greater than 1. Then the sumset A\ + • - • + ^42k+i contains a block of consecutive 
integers of length 2(k + l)(n — 1) — I > I. 

Observe, that Theorem H] guarantees the existence of a block of consecutive integers 
of length I in A\ + ■ ■ ■ + A^ for k — 2k + 1, which is a sharp threshold: for k = 2k the 
sumset may fail to contain such a block, as witnessed, say, by the system of identical 

sets 

A 1 = ■ ■ ■ = A 2K = [0, m] U [I - m, I] 
with m G [1,1/2) integer and n = 2m + 2. Indeed, in this case we have 

2k 

A 1 + ■ ■ • + A 2K = |J [j(l - m),j(l -m) + 2kw\, 

j=0 

the length of each individual segment being 2/tm = k(u — 2) < /, and the segments 
not abutting, provided {(/ — l)/(n — 2)} > 1/2. 

Unfortunately, we were unable to eliminate the assumption that none of the sets 
Aj are contained in an arithmetic progression with difference greater than 1. This 
reminiscent of the condition gcd(v4) = 1 from Theorem [2] will be discussed in Section[3j 

The reader may compare the following corollary against Theorem [3j 
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Corollary 1. Suppose that k, I ^ 1 and n ^ 3 are integers, and A\, . . . , A k Q [0, /] are 
integer sets, having at least n elements each, and such that none of them are contained 
in an arithmetic progression with difference greater than 1. Ifk ^ 2 \{l — l)/(n — 2)] , 
then the sumset Ax+- ■ -+A k contains a block of consecutive integers of length k(n — l). 

Proof. Reducing the value of I and renumbering the sets, if necessary, we can assume 

that 0, 1 G A k . Write k := \{l - l)/(n - 2)] - 1. By TheoremiJ the sumset A\-\ h 

A 2k+ i contains a block of consecutive integers of length at least 2(k+ l)(n — 1) —l>l. 
Adding one by one the sets A 2k + 2 , ■ ■ ■ , A k -i to this sumset, we increase the length of 
the block by at least n — 1 each time, and adding A k at the last step we increase the 
length by I. Consequently, A\ + ■ — h A^ contains a block of length at least 

(2(k + l)(n - 1) - I) + {k - 2k - 2)(n -l) + l = k(n- 1). 

□ 

2. Proof of Theorem H] 

For a finite, non-empty integer set A let £(A) denote the difference of the largest 
and the smallest elements of A. 

Our approach is fairly close to that employed in [L97bj . with the following result 
in its heart. 

Theorem 5 (Lev [L97a, Theorem 1]). Let k ^ 2 be an integer, and suppose that 
Ai, . . . ,Ak are finite, non-empty integer sets. If £(Aj) ^ £{Ak) for j = 1, . . . , k — 1 
and Ak is not contained in an arithmetic progression with difference greater than 1, 
then 

\At H h A k \ ^ \A X H h A k _ x \ + min{£(A fc ), m H h n k - k + 1}, 

where 

a, | if e^) < i{A k ) _ 

Aj\-\ if l{A j )=l(A k )' 

Up to some subtlety which we suppress for the moment, the strategy pursued 
below is to apply Theorem [5] to show that if k is large enough (which in practice 
means k ^ 2k, + 0(1)), then the densities of the sumsets A\ + ■ ■ ■ + Ay k / 2 \ and 
A[k/2\+i + • • • + A k exceed 1/2, and then use the box principle to conclude that the 
sumset of the two, which is Ai+- ■ -+A k , contains a long block of consecutive integers. 
We start with the second, technically simpler, component. 
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Lemma 1. Let L\ and L 2 be positive integers and suppose that Si C [0, L±] and 
S 2 Q [0,-^2] are integer sets. Ifm&x{Li, L 2 } ^ |Si| + \S 2 \ — 2, then 

[L x + L 2 - (| ft I + \S 2 \ - 2), + \S 2 \ -2}CS l + S 2 . 

Proof. Given an integer g G [0, L\ + L 2 ], the number of representations g = si + s 2 
with arbitrary s\ G [0, Li] and s 2 G [0, L 2 ] is 

\(g - [o,Li]) n [0, L 2 ] I = I [<? - Li , g ] n [o, L 2 ] | 

= min{g, L 2 } — max{g — L\, 0} + 1 
= min{#, L 2 } + min{#, L x } - # + 1. 

In order for g G S\ + 5" 2 to hold, it suffices that this number of representations exceeds 
the total number of "gaps" in S\ and S 2 ; that is, 

min{g, L 2 } + min{g, Li} - g + 1 > L]. + L 2 + 2 — \Si\ — \S 2 \. 

This, however, follows immediately for every g in the interval [Lx + L 2 — (\Si \ + \S 2 \ — 
2), \ Si \ + \S 2 \ — 2], by considering the location of g relative to L\ and L 2 . □ 

We now turn to the more technical part of the argument, consisting in inductive 
application of Theorem [5j 

Proposition 1. Let k,l ^ 1, and n ^ 3 be integers, and suppose that A\,...,Af. 
are integer sets with \Ai\ ^ n and £{Aj) ^ I for i = 1, . . . , k, such that none of these 
sets are contained in an arithmetic progression with difference greater than 1. Write 
S = A 1 + --- + A k . 

(i) If k ^ (Z-l)/(n-2)-l, then \S\ ^ \ {£{S) + (k + l)(n - 1) - I + 2); 

(ii) ifk>{l- \)/{n - 2), then \S\ > \ (£(S) + k(n - 1) + 2). 

Observe, that under assumption (i) we have (k + l)(n — 1) — I + 2 > 0, so that 
\S\ > £(S)/2 in this case. Similarly, under assumption (ii) we have k(n — 1) + 2 > I, 
and hence in this case the stronger estimate |>S| > (£(S) + 1)/2 holds. 

Proof of Proposition^ Write lj = £{Aj). Without loss of generality, we can assume 
that h ^ ■ ■ ■ ^ h- By Theorem [5] and in view of £(S) = li + • • • + 4, we have 

k 

\S\-~£(S)>'£(mm{l j -l,j(n-2)} + l)+l-'t-(l 1 + ... + l k ) 

= ^ min |i_l )j(n _2)-^| + | +1. (1) 
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Starting with assertion (ii), assume that k ^ (Z — l)/(n — 2). Color all integers 
j G [1, k] red or blue, according to whether lj — 1 ^ j(n — 2) or Zj — 1 < j(n — 2). Notice 
that the integer 1 is then colored red, hence the interval [1, k] can be partitioned into 
a union of adjacent sub intervals J\ U • • ■ U Jk so that each subinterval consists of a 
block of consecutive red integers followed by a block of consecutive blue integers, with 
all blocks non-empty — except that the rightmost subinterval Jk may consist of a 
"red block" only. Accordingly, we write the sum over j £ [1, k] in the right-hand side 
of ([I]) as o\ + • ■ ■ + ok, where for each v 6 [1, if] by o v we denote the sum over all 
j e J„. 

Fixing z/ e [1, if], write Jj, = [s, £] and define q to be the largest red-colored number 
of the interval [s,t — 1]. This does not define q properly if v = K and s = t = k; 
postponing the treatment of this exceptional case, suppose for the moment that q 
is well defined. Thus either q + 1 is blue, whence l q+ i — 1 < (q + l)(n — 2), or 
v = K, t = k, and q = k — 1, whence 

l q+1 - 1 = l k - 1 ^ Z - 1 ^ k(n - 2) = (q + l)(n - 2). 

Observe, that 

Z g+ i-l ^ (g + l)( n -2) (2) 
holds in either case, and it follows that 

j=s ^ ' j=q+l 

>^tf + Q-s 2 + s)- l -^(q+l-s) + l -^(t-q) 

= ^ (q 2 + q-s 2 + s) + (s + £ - 1 - 2q). 

We now distinguish two cases: q ^ (s + £ — l)/2 and q ^ (s + £ — l)/2. In the former 
case we have 

n — 2 

0V ^ — ^— (g 2 + g - s 2 + s + q(s + £ - 1 - 2g)) 
= r ^ L {q{s + t-q)-s 2 + s) 

= (st + (q- s)(t -q)-s 2 + s) 

n — 2 . 9 
^ — ^— (s£ - s 2 + s) 

n-2 , , , 
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In the latter case, taking into account that l q — 1 ^ l q+ i — 1 ^ (q + l)(n — 2) by (J2]), 
we obtain the same estimate: 

n — 2 

<7„ > — ^— (g 2 + q - s 2 + s + (q + l)(s + t - 1 - 2q)) 
n — 2 

= — ^— (q(s + t-2-q)-s 2 + 2s-l + t) 

> ((* - - 1) - s 2 + 2s - 1 + t) 

n — 2 . 9 . 
= («* - s 2 + s) 

n-2 . . 
= —*\U 

Addressing finally the situation where v = K and s = t = k, we observe that in this 



case 



o v = k(n - 2) - lk - 
^ fc(n - 2) - 



2 

fc(n - 2) 
2 

n-2 



as above. Thus, 



n — 2 n — 2 

(7i + -.. + <7jf ^ — ^— (|Ji| + ••• + \J K \) = , 

and substituting this into (JTJ) we obtain assertion (ii). 

To prove assertion (i), instead of k ^ (Z — l)/(n — 2) assume now the weaker bound 
k > (Z - l)/(n - 2) - 1. Set Z fe+ i = Z fe , so that l k+1 - l<Z-l<(jfc + l)(n - 2). 
From ((Tj) we get 

fc+i 



|5| - ^(5) > f>in - 2) - 

7 = 1 ^ ' 



4+1 ~ 1 + k 

2 2 



j-- 

Since k+1 ^ (Z — l)/(n — 2), the sum over j can be estimated as above, and it is at 
least (k + l)(n — 2)/2. Assertion (i) now follows in view of 

(fc + l)(n-2) l k+1 -l k 
2 2 2 

^l((k+ l){n - 2) - (Z - 1) + k + 2) = ]- ({k+ l)(n - 1) - Z + 2). 

□ 
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Proposition [T] took us most of the way to the proof of Theorem HI 

Proof of Theorem^ Assume that the sets A±, . . . , A 2k+ i are so numbered that, letting 
lj = £(Aj) for j G [1, 2k + 1], we have Z x ^ Z 2 ^ • • • ^ / 2k +i. We are going to partition 
our sets into two groups to apply Lemma [T] to the sumsets S\ and £2 of these groups, 
and this is to be done rather carefully as effective application of the lemma requires 
that Si and S 2 be of nearly equal length. 

Accordingly, we let Si := A\ + A s + • • • + A 2k -i (including all sets A,- with odd 
indices j < 2k) and S 2 := (A 2 + A^ + ■ — h A 2k ) + A 2k+1 (all sets Aj with even indices 
j ^ 2k and A 2k+ i). By Proposition [T] we have 

\Si\>\ (KSi) + (« + l)(n - 1) - Z + 2) (3) 

and 

|S 2 | ^(£(£ 2 ) + (K + l)(n-l) + 2). (4) 

Furthermore, from 

i(Si) - £(S 2 ) = (h - l 2 ) + ■ ■ ■ + (Z 2K _! - l 2K ) - l 2K+ i G [-1, 1], 
using ([3]) and (TJJ we get 

msx{£(Si),e(S 2 )} < ~ (^(Si) + + z) 

< |Si| + |5 2 | + Z - (« + l)(n - 1) - 2 < | Si | + |5 2 | - 2. 

Applying Lemma [1] and using again (j3J) and (TjJ, we conclude that A x + • • • + A k = 
Si + S 2 contains a block of consecutive integers of length at least 

2(|Si| + \S 2 \ - 2) - (£(Si) + £(S 2 )) > 2(« + l)(n - 1) - Z, 

as required. □ 

3. Concluding remarks and open problems 

The major challenge arising in connection with the main results of this paper (which 
are Theorem 0] and Corollary [1]) is to get rid of the assumption that none of the sets 
involved are contained in an arithmetic progression with difference greater than 1. 
One can expect that a vital ingredient of such an improvement would be a suitable 
generalization of Theorem Indeed, we were able to generalize Theorem in what 
seems to be the right direction. 
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Theorem 5'. Let k ^ 2 be an integer, and let Ai, . . . , A k be finite, non-empty integer 
sets. Suppose that I G A k — A k is a positive integer, and define d to be the largest 
integer such that A k is contained in an arithmetic progression with difference d. Then 

\A X H h A k \ ^ |Ai H h A fe _i| + min{hl/d, n x H V n k - k + 1}, 

where h is the number of residue classes modulo d, represented in A x + • • • + A k _i, 
and nj (j = 1, . . . , k) is the number of residue classes modulo I, represented in Aj. 

Clearly, Theorem I5 7 ] implies Theorem [5] and, furthermore, shows that the assump- 
tion of Theorem H] that none of the sets Aj are contained in an arithmetic progression 
with difference greater than 1 can be slightly relaxed. Specifically, it suffices to re- 
quest that the Aj can be so ordered that £(Aj) increase, and for every k G [1, 2k] the 
sumset Ai + ■ ■ ■ + A k represents all residue classes modulo gcd(A k+1 — A k+ i). It seems, 
however, that Theorem by itself fails short to extend Theorem H] the desired way, 
dropping the modular restriction altogether, and in the absence of an application we 
do not present here the proof of the former theorem. 

Another interesting direction is to refine Theorem H] as to the length of the block, 
contained in the sumset A x + • • • + A 2k +i- While we observed that 2k + 1 is the 
smallest number of summands which ensures a block of length /, it is quite possible 
that the existence of a longer block can be guaranteed. In this connection we mention 
the following conjecture from |L97bj . referring to the equal summands situation. 

Conjecture 1. Let k, I ^ 1 and n ^ 3 be integers, and write k := [(l — l)/(n — 2)J . 
Suppose that A C [0,/] is an integer set with 0,1 E A, gcd(A) = 1, and \A\ = n. If 
k ^ 2k + 1, then kA contains a block of consecutive integers of length (k — K,)l + k((K + 
\){n -2) + 2-1). 

In fact, Conjectured] is established in [L97bj in the case where k ^ 3k, but the case 
2k + 1 ^ k < 3k, to our knowledge, remains open. 
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